Differentially private genome data dissemination through top-down specialization
نویسندگان
چکیده
Advanced sequencing techniques make large genome data available at an unprecedented speed and reduced cost. Genome data sharing has the potential to facilitate significant medical breakthroughs. However, privacy concerns have impeded efficient genome data sharing. In this paper, we present a novel approach for disseminating genomic data while satisfying differential privacy. The proposed algorithm splits raw genome sequences into blocks, subdivides the blocks in a top-down fashion, and finally adds noise to counts to preserve privacy. The experimental results suggest that the proposed algorithm can retain certain data utility in terms of a high sensitivity.
منابع مشابه
A Scalable Two-Phase Top-Down Specialization Approach for Data Anonymization Using Map Reduce on Cloud
: More number of users requires cloud services to transfer private data like electronic health records and financial transaction records. A cloud computing services offers several flavors of virtual machines to handle large scale datasets. But centralized approaches are difficult in handling of large datasets. Data anonymization is used for privacy preservation techniques. It is challenged to m...
متن کاملData Anonymization of Vertically Partitioned Data Using Mapreduce on Cloud
In the world of computers, cloud services, on large scale, are being offered by service providers. User wishes to share some private information that has been stored on the cloud server due to various reasons such as data analysis, data mining and so on. These things bring up a concern about privacy. Privacy preservation may be attained by Anonymization data sets via normalization for satisfyin...
متن کاملDifferentially Private Set-Valued Data Release against Incremental Updates
Publication of the private set-valued data will provide enormous opportunities for counting queries and various data mining tasks. Compared to those previous methods based on partition-based privacy models (e.g., k-anonymity), differential privacy provides strong privacy guarantees against adversaries with arbitrary background knowledge. However, the existing solutions based on differential pri...
متن کاملA Scalable Two-Phase Top-DownSpecialization Approach for Data Anonymization Using MapReduce on Cloud
A large number of cloud services require users to share private data like electronic health records for data analysis or mining, bringing privacy concerns. Anonymizing data sets via generalization to satisfy certain privacy requirements such as k-anonymity is a widely used category of privacy preserving techniques. At present, the scale of data in many cloud applications increases tremendously ...
متن کاملBig Data Processing with Privacy Preserving Using Map Reduce on Cloud
A large number of cloud services require users to share private data like electronic health records for data analysis or mining, bringing privacy concerns. Anonymizing data sets via generalization to satisfy certain privacy requirements such as k-anonymity is a widely used category of privacy preserving techniques. At present, the scale of data in many cloud applications increases tremendously ...
متن کامل